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This Chapter is written for the Festschrift celebrating the 70th birthday of the distinguished 
economist Duncan Foley from the New School for Social Research in New York. This Chapter 
reviews applications of statistical physics methods, such as the principle of entropy maximization, 
to the probability distributions of money, income, and global energy consumption per capita. 
The exponential probability distribution of wages, predicted by the statistical equilibrium 
theory of a labor market developed by Foley in 1996, is supported by empirical data on 
income distribution in the USA for the majority (about 97%) of population. In addition, the 
upper tail of income distribution (about 3% of population) follows a power law and expands 
dramatically during financial bubbles, which results in a significant increase of the overall 
income inequality. A mathematical analysis of the empirical data clearly demonstrates the 
two-class structure of a society, as pointed out Karl Marx and recently highlighted by the Occupy 
Movement. Empirical data for the energy consumption per capita around the world are close 
to an exponential distribution, which can be also explained by the entropy maximization principle. 

"Money, it's a gas." Pink Floyd, Dark Side of the Moon 



I. HOW I MET DUNCAN FOLEY 

Although I am a theoretical physicist, I have been al- 
ways interested in economics and, in particular, in appli- 
cations of statistical physics to economics. These ideas 
first occurred to me when I was an undergraduate student 
at the Moscow Physical- Technical Institute in Russia and 
studied statistical physics for the first time. However, it 
was not until 2000 when I published my first paper on 
this subject (Dragulescu and Yakovenko 20001, joining 



the emerging movement of econophysics (Farmer, Shu- 



bik, and Smith 2005 Hogan 2005 Shea 2005 ). At that 



time, I started looking for economists who may be inter- 
ested in a statistical approach to economics. I attended 
a seminar by Eric Slud, a professor of mathematics at 
the University of Maryland, who independently explored 
similar ideas and eventually published them in |Silver,| 
Slud, and Takamoto (2002). In this paper, I saw a ref- 
erence to the paper by Foley (1994). So, I contacted 



Duncan in January 2001 and invited him to give a talk 
at the University of Maryland, which he did in March. 
Since then, our paths have crossed many times. I visited 
the New School for Social Research in New York several 
times, and we also met and discussed at the Santa Fe In- 
stitute, where I was spending a part of my sabbatical in 
January-February 2009, hosted by Doyne Farmer. Dur- 
ing these visits, I also met other innovative economists 
at the New School, professors Anwar Shaikh and Willi 
Semler, as well as Duncan's Ph.D. student at that time 
Mishacl Milakovic, who is now a professor of economics 
at the University of Bamberg in Germany. 



As it is explained on the first page of Foley (1994), 



Duncan learned statistical physics by taking a course on 
statistical thermod ynami c s at Dartmouth C ollege. For 
me, the papers by Foley (1994 1996 19991 conjure an 



image of people who meet on a bridge trying to reach for 
the same ideas by approaching from the opposite banks, 
physics and economics. Over time, I learned about other 
papers where economists utilize statistical and entropic 
ideas, e.g. 



Aoki and Yoshikawa (20061; Golan (19941; 



Molico (2006), but I also realized how exceedingly rare 



statistical approach is among mainstream economists. 
Duncan is one of the very few innovative economists who 
has deeply studied statistical physics and attempted to 
make use of it in economics. 

One of the puzzling social problems is persistent eco- 
nomic inequality among the population in any society. 
In statistical physics, it is well known that identical 
("equal") molecules in a gas spontaneously develop a 
widely unequal distribution of energies as a result of ran- 
dom energy transfers in molecular collisions. By analogy, 
very unequal probability distributions can spontaneously 
develop in an economic system as a result of random in- 
teractions between economic agents. This is the main 
idea of the material presented in this Chapter. 

First, I will briefly review the basics of statistical 
physics and then discuss and compare the applications 



of these ideas to economics in |Foley| ( |1994 
and in 



my papers, from the first paper (Dragulescu 



1996 1999) 



and Yakovenko _ 2000 ) to the most r e cent review pa- 
pers ( Banerjee and Yakovenko 2010 Yakovenko and 



Rosser 2009) and books (Cockshott, Cottrell, Michael- 



son, Wright, and Yakovenko| |2009| |Yakovenko 20111. As 



we shall see, statistical and entropic ideas have a multi- 
tude of applications to the probability distributions of 
money, income, and global energy consumption. The 
latter topic has relevance to the ongoing debate on the 



economics of global warming (Rezai, Foley, and Taylor 
2012). Duncan Foley and Eric Smith of the Santa Fe 



Institute have also published a profound study of the 
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FIG. 1 A quantum paramagnet, where each atom has the 
spin s = 1/2 and is in one of the two possible states (q = 2): 
k — 1 =t or k = 2 The atoms are labeled consecutively 
by the integer index j = 1, 2, . . . , N. 



connection between phenomenological thermodynamics 
in physics and the utility formalism in economics ( Smith 



and Foley 20081. This paper will not be reviewed here 
because of a limited volume of this Chapter. Unfortu- 
nately, there is still no complete understanding of a con- 
nection between statistical mechanics and phenomeno- 
logical thermodynamics as applied to economics. These 
issues remain open for a future study. I will only focus 
on statistical mechanics in this Chapter. 



II. ENTROPY AND THE BOLTZMANN-GIBBS 
DISTRIBUTION OF ENERGY IN PHYSICS 

In this section, I briefly review the basics of statisti- 
cal physics, starting from a discrete model of a quantum 
paramagnet. Let us consider a collection of N atoms 
and label the individual atoms with the integer index 
j = 1, 2, . . . , N. It is convenient to visualize the atoms as 
sitting on a lattice, as shown in Fig. [I] Each atom has 
an internal quantum degree of freedom called the spin s 
and can be in one of the q discrete states. 1 These states 

are labeled by the index k = 1,2 q. The system is 

placed in a external magnetic field, so each discrete state 
has a different energy £&. Then, the state of an atom j 
can be characterized by the energy Sj that this atom has. 
(Throughout the paper, I use the indices i and j to label 
individual atoms or agents, and the index k to label pos- 
sible states of the atoms or agents.) A simple case with 
q = 2 is illustrated in Fig. [TJ where each atom can be 
in one of the two possible states k = 1 =t or k = 2 =|, 
which are depicted as the spin-up and spin-down states. 
Alternatively, one can interpret each atom as a bit of 
computer memory with two possible states and 1. For 
q = 8, each atoms would correspond to a byte with 8 
possible values. The atoms can be also interpreted as 
economic agents, where the index k indicates an internal 
state of each agent. 

For a given configuration of atoms, let us count how 
many atoms are in each state k and denote these numbers 
as N k , so that J2l=i Nk = N. Let us introduce the 
multiplicity fi, which is the number of different atomic 
configurations that have the same set of the numbers 



Nk- Calculating Q, we treat the atoms as distinguishable, 
because they are localized on distinguishable lattice sites 
labeled by the index j in Fig. [T] For example, let us 
consider the case of N = 2 and q = 2, i.e. two atoms 
with two possible states. The set N% = 1 and N2 — 
1 can be realized in two possible ways, so fi = 2. In 
one configuration, the first atom j = 1 is in the state 
k = 1 =t with the energy £1 = £f , whereas the second 
atom j = 2 is in the state k — 2 =J, with the energy 
£2 = £4.. In another configuration, the states of the atoms 
are reversed. On the other hand, the set N^ = 2 and 
N% = can be realized in only one way, where both atoms 
are in the state k — 1 —f with the energies £1 = e% = Ef, 
so ft = 1. In general, the multiplicity £1 can be calculated 
combinatorially as the number of different placements of 
N atoms into q boxes with the fixed occupations Nk of 
each box: 
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(1) 



The logarithm of multiplicity is called the entropy S — 
InQ. Using the Stirling approximation for the factorials 
in the limit of large numbers Nk , we obtain 



S = N\nN - N k \nN k = - £ N k 



hi 



fc=i 



fc=i 



N 



(2) 



In the absence of further information, we assume that 
all microscopic configurations of atoms are equally prob- 
able. Then, the probability of observing a certain set of 
the numbers Nk is proportional to the number of possible 
microscopic realizations of this set, i.e. to the multiplic- 
ity il. So, the most probable set of the numbers A^. is 
the one that maximizes Q or S subject to certain con- 
straints. The typical constraints for a closed system are 
that the total number of atoms N and the total energy 
E are fixed: 



A^ 
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fe=i 
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(3) 



To implement the constraints, we introduce the Lagrange 
multipliers a and ft and construct the modified entropy 



S = S + a^N l 



fe=i 



1 

1 -pj2 £kNk - 

fc=i 



(4) 



Maximization of S is achieved by setting the derivatives 
dS/dNk to zero for each A^,. Substituting Eq. (ji]) into 
Eq. Q and taking the derivatives, 2 we find that the oc- 
cupation numbers P{£k) = Nk/N for the states k depend 
exponentially on the energies of these states £& 



Nk 
N 



-(e„-/*)/T 



(5) 



1 In quantum mechanics, s takes integer or half-integer values, and 
q = 2s + 1, but this is not important here. 



2 Notice that N = ^ fe Nk in Eq. (jSJl should be also differentiated 
with respect to Nk- 
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Here the parameters T — 1/(3 and \i = aT are called 
the temperature and the chemical potential. The values 
of a and (3 are determined by substituting Eq. ([5| into 
Eq. Q and satisfying the constraints for given N and E. 
The temperature T is equal to the average energy per 
particle T ~ (e) = E/N, up to a numerical coefficient of 
the order of one. The interpretation of Eq. ^ is that 
the probability for an atom to occupy a state with the 
energy e depends exponentially on the energy: P{e) oc 
exp(-e/T). 

This consideration can be generalized to the case where 
the label k becomes a continuous variable. In this case, 
the sums in Eqs. and Q are replaced by inte- 

grals with an appropriate measure of integration. The 
typical physical example is a gas of atoms moving in- 
side a large box. In this case, the atoms have closely- 
spaced quantized energy levels, and the number of quan- 
tum states within a momentum interval Ap and a space 
interval Ax is ApAx/h, where h is the Planck constant. 
Then, the measure of integration in Eqs. Q, |3]), and 
Q becomes the element of volume of the phase space 
dpdx/h in one-dimensional case and d 3 pd 3 r /h 3 in three- 
dimensional case. The energy in Eq. ([5]) becomes the 
kinetic energy of an atom e = p 2 /2m, which is bounded 
from below s > 0. 

Eq. ([5]) is the fundamental law of equilibrium statis- 



tical physics (Wannier 
Gibbs distribution. 13 



1987|), known as the Boltzmann- 
one can see that the above 



However, 

derivation is really an exercise in theory of probabilities 
and, as such, is not specific to physics. Thus, it can be 
applied to statistical ensembles of different nature, sub- 
ject to constraints similar to Eq. pi). Interpreting the 
discrete states of the atoms as bits of information, one 
arrives to the Shannon entropy in the theory of informa- 
tion. Entropy concepts have been applied to such dis- 



parate fields as ecology ( 


Banavar, Maritan, and Volkov 


2010| |Kelly, Blundell, Bowler, Fox, Harvey, Lomas, and 


Woodward 2011 


) and neuroscience ( Varshney, Sjostrom, 


and Chklovskii 2006| Wen, Stepanyants, Elston, Gros- 


berg, and Chklovskii 


2009 


1. 



The economy is also a big statistical system consist- 
ing of a large number of economic agents. Thus, it is 
tempting to apply the formalism presented above to the 
economy as well. However, there are different ways of 
implementing such an analogy. In the next section, I 



briefly summarize an analogy developed in Foley ( 1994 
p96lp99l ). 



3 In physics, the elementary particles are indistinguishable and 
obey the Fermi-Dirac or Bose-Einstein statistics, rather than the 
Boltzmann statistics. In contrast, the economic agents are distin- 
guishable because of their human identity. Thus, the Boltzmann 
statistics for distinguishable objects is appropriate in this case. 



III. STATISTICAL EQUILIBRIUM THEORY OF 
MARKETS BY DUNCAN FOLEY 



In Foley (1994), Duncan proposed a statistical equi- 
librium theory of markets. This theory is an alternative 
to the conventional competitive equilibrium theory orig- 
inated by Walras and by Marshall. In the conventional 
theory, an auctioneer collects orders from buyers and sell- 
ers and determines an equilibrium price that clears the 
market, subject to budget constraints and utility prefer- 
ences of the agents. In this theory, all transactions take 
place at the same price, but it is not quite clear how the 
real markets would actually converge to this price. In 
contrast, Duncan proposed a theory where is a probabil- 
ity distribution of trades at different prices, and market 
clearing is achieved only statistically. 

iFoley ( 1994 ) studied an ensemble of N economic agents 



who trade n types of different commodities. The state of 
each agent j = 1, 2, 3, . . . , N is characterized by the n- 



component vector a%- 



» 



). Each 



"3 - 1 > >••_•) x j 

component of this vector represents a possible trade that 
the agent j is willing to perform with a given commodity. 
Positive, negative, and zero values of Xj represent an in- 
crease, a decrease, and no change in the stock of a given 
commodity for the agent j. All trades in the system are 
subject to the global constraint 



0. 



(6) 



which represents conservation of commodities in the pro- 
cess of trading, i.e. commodities are only transferred be- 
tween the agents. An increase x% > of a commodity 
stock for an agent i must be compensated by a decrease 
Xj < for another agent j, so that the algebraic sum Q 
of all trades is zero. However, Eq. ^ does not require 
bilateral balance of transactions between pairs of agents 
and allows for multilateral trades. 

The vector Xj can be considered as an n-dimensional 
generalization of the variable Ej introduced in Sec. [Hi to 
characterize the state of each atom. Different kinds of 
trades Xk are labeled by the index k, and the number of 
agents doing the trade x^ is denoted as Nk- Then the 
constraint ^ can be rewritten as 



V, x k N k 

' * k 



0. 



(7) 



For a given set of Nk , the multiplicity (JTJ) gives the num- 
ber of different assignments of individual agents to the 
trades Xk, such that the numbers Nk are fixed. Maxi- 
mizing the entropy ([2]) subject to the constraints ([7]), we 
arrive to an analog of Eq. ([5| , which now has the form 



P(x k ) 



Nk 
N 



ce 



(8) 



Here c is a normalization constant, and 7t is the n- 
componcnt vector of Lagrange multipliers introduced to 
satisfy the constraints Q . |Foley| ( |1994[ ) interpreted n as 
the vector of entropic prices. The probability for an agent 
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to perform the set of trades x depends exponentially on 
the volume of the trades: P(x) oc exp(— 7r ■ x). 



([7]) can be rewritten as 



In Foley (19961, the general theory (Foley 19941 was 
applied to a simple labor market. In this model, there 
are two classes on agents: employers (firms) and employ- 
ees (workers). They trade in two commodities (n = 2): 
= w is wage, supplied by the firms and taken by the 
workers, and x*- 2 -* = I is labor, supplied by the workers 
and taken by the firms. For each worker, the offer set 
includes the line x = (w > Wq,— 1), where —1 is the 
fixed offer of labor in exchange for any wage w greater 
than a minimum wage wq. The offer set also includes 
the point x = (0, 0), which offers no labor and no wage, 
i.e. the state of unemployment. For each firm, the offer 
set is the line x = (—K,l > Iq), where —K is the fixed 
amount of capital spent on paying wages in exchange for 
the amount of labor I greater than a minimum value Zo- 
A mathematically similar model was also developed 
in Foley ( |1999[ ) as a result of conversations with Perry 
Mehrling. This model also has two commodities (n = 2), 
one of which is interpreted as money and another as a 
financial asset, such as a bond or a treasury bill. 

Let us focus on the labor market model by |Foley| 
(1996). According to Eq. (JsJ) , the model predicts the 



exponential probability distributions for the wages w re- 
ceived by the workers and for the labor I employed by 
the firms. Let us try to compare these predictions with 
empirical data for the real economy. The probability dis- 
tribution of wages can be compared with income distri- 
bution, for which a lot of data is available and which will 
be discussed in Sec. [VlJ On the other hand, the distribu- 
tion of labor employed by the firms can be related to the 
distribution of firm sizes, as measured by their number 



of employees. However, Foley (1996) makes an artificial 



simplifying assumption that each firm spends the same 
amount of capital K on labor. This is a clearly unre- 
alistic assumption because of the great variation in the 
amount of capital among the firms, including the capital 
spent on labor. Thus, let us focus only on the probability 
distribution of wages, unconditional on the distribution 
of labor. To obtain this distribution, we take a sum in 
Eq. (|8| over the values of x^ 2 \ while keeping a fixed value 
for ajW. In physics jargon, we "integrate out" the degree 
of freedom x^ — I and thus obtain the unconditional 
probability distribution of the remaining degree of free- 
which is still exponential 



dom x^ = w 



P(w) 



ce 



-w/T w 



(9) 



Here c is a normalization constant, and T w = l/ir 1 -- 1 * 1 is 
the wage temperature. 

Now let us discuss how the constraint ^ is satisfied 
with respect to wages. The model has Nf firms, which 
supply the total capital for wages W = KNf, which en- 
ters as a negative term into the sum ([7]). Since only the 
total amount matters in the constraint, we can take W 
as an input parameter of the model. Given that the un- 
employed workers have zero wage w = 0, the constraint 



E 



w k >w 



w k N k = W, 



(10) 



where the sum is taken over the employed workers, whose 
total number we denote as N e . The average wage per em- 
ployed worker is (w) = W/N e . Using Eq. ^ and replac- 
ing summation over k by integration over w in Eq. (10 1, 
we relate (w) and T w 



W_ _ Q w P{w) dw 



T w = (w) - Wq. (11) 



So, the wage temperature T w is a difference of the average 
wage per employed worker and the minimal wage. 

The model by |Foley| ( |1996[ ) also has unemployed work- 
ers, whose number N u depends on the measure of statis- 
tical weight assigned to the state with w = 0. Because 
this measure is an input parameter of the model, one 
might as well take N u as an input parameter. I will not 
further discuss the number of unemployed and will focus 
on the distribution of wages among the employed work- 
ers. It will be shown in Sec. VI that the exponential 



distribution of wages ^ indeed agrees with the actual 
empirical data on income distribution for the majority of 
population. 



IV. STATISTICAL MECHANICS OF MONEY 



The theory presented in Sec. Ill focused on market 
transactions and identified the states of the agents with 
the possible vectors of transactions Xj . However, an anal- 
ogy with the Boltzmann-Gibbs formalism can be also de- 
veloped in a di f ferent way, as proposed by [Dragulcscu 
and Yakovenko (2000). In this paper, the states of the 



agents were identified with the amounts of a commodity 
they hold, i.e. with stocks of a commodity, rather than 
with fluxes of a commodity. This analogy is closer to the 
original physical picture presented in Sec. |Hj where the 
states of the atoms were identified with the amounts of 
energy Sj held by the atoms (i.e. the stocks of energy), 
rather than with the amounts of energy transfer in colli- 
sions between the atoms (i.e. the fluxes of energy). In this 
section, I summarize t he theory developed by |Dragulescu| 
and Yakovenkol (120001). 



Similarly to Sec. |III| let us consider an ensemble of 
N economic agents and characterize their states by the 
amounts (stocks) of commodities they hold. Let us intro- 
duce one special commodity called money, 4 whereas all 
other commodities are assumed to be physically consum- 
able, such as food, consumer goods, and services. Money 
is fundamentally different from consumable goods, be- 
cause it is an artificially created, non-consumable object. 



4 To simplify consideration, I use only one monetary instrument, 
commonly known to everybody as "money" . 
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I will not dwell on historical origins and various physical 
implementations of money, but will proceed straight to 
the modern fiat money. Fundamentally, money is bits 
of information (digital balances) assigned to each agent, 
so money represents an informational layer of the econ- 
omy, as opposed to the physical layer consisting of various 
manufactured and consumable goods. Although money 
is not physically consumable, and well-being of the eco- 
nomic agents is ultimately determined by the physical 
layer, nevertheless money plays an extremely important 
role in the modern economy. Many economic crises were 
caused by monetary problems, not by physical problems. 
In the economy, monetary subsystem interacts with phys- 
ical subsystem, but the two layers cannot transform into 
each other because of their different nature. For this rea- 
son, an increase in material production does not result in 
an automatic increase in money supply. 

Let us denote money balances of the agents j = 
1,2, . . . , N by m.j. Ordinary economic agents can only 
receive money from and give money to other agents, and 
are not permitted to "manufacture" money, e.g. to print 
dollar bills. The agents can grow apples on trees, but 
cannot grow money on trees. Let us consider an eco- 
nomic transaction between the agents i and j. When the 
agent i pays money Am to the agent j for some goods or 
services, the money balances of the agents change as 



mi — > mi = mi — Am, 
nij — » m'j = mj + Am. 



(12) 



The total amount of money of the two agents before and 
after the transaction remains the same 

m; + mj = to - + m'j, (13) 

i.e. there is a local conservation law for money. The trans- 



fer of money ( 12 ) is analogous to the transfer of energy 



in molecular collisions, and Eq. (131 is analogous to con- 



servation of energy. Conservative models of this kind are 



also studied in some economic literature (Kiyotaki and 



Wright 


1993 


Molico 


2006) 



Enforcement of the local conservation law ( 13 1 is cru- 
cial for successful functioning of money. If the agents 
were permitted to "manufacture" money, they would be 
printing money and buying all goods for nothing, which 
would be a disaster. In a barter economy, one consumable 
good is exchanged for another (e.g. apples for oranges), so 
physical contributions of both agents are obvious. How- 
ever, this becomes less obvious in a monetary economy, 
where goods are exchanged for money and money for 
goods. The purpose of the conservation law ( 12 ) is to 



ensure that an agent can buy goods from the society only 
if he or she contributed something useful to the society 
and received monetary payment for these contributions. 
Money is an accounting device, and, indeed, all account- 
ing rules are based on the conservation law (12 1. 



Unlike ordinary economic agents, a central bank or a 
central government, who issued the fiat money in the first 
place, can inject money into the economy, thus changing 



the total amount of money in the system. This process is 
analogous to an influx of energy into a system from exter- 
nal sources. As long as the rate of money influx is slow 
compared with the relaxation rate in the economy, the 
system remains in a quasi-stationary statistical equilib- 
rium with slowly changing parameters. This situation is 
analogous to slow heating of a kettle, where the kettle has 
a well defined, but slowly increasing, temperature at any 
moment of time. Following the long-standing tradition in 
the economic literature, this section studies the economy 
in equilibrium, even though the real-world economy may 
be totally out of whack. An economic equilibrium im- 
plies a monetary equilibrium too, so, for these idealized 
purposes, we consider a model where the central author- 
ities do not inject additional money, so the total amount 
of money M held by all ordinary agents in the system is 
fixed. In this situation, the local conservation law ( 13 ) 
becomes the global conservation law for money 



= M. 



(14) 



It is important, however, that we study the system in a 
statistical equilibrium, as opposed to a mechanistic equi- 
librium envisioned by Walras and Marshall. For this rea- 



son, the global constraint ( 14 1 is actually not as crucial 
for the final results, as it might seem. In physics, we 
usually start with the idealization of a closed system, 
where the total energy E in Eq. (|3| is fixed. Then, after 
deriving the Boltzmann-Gibbs distribution ([5| and the 
concept of temperature, we generalize the consideration 
to an open system in contact with a thermal reservoir at 
a given temperature. Even though the total energy of the 
open system is not conserved any more, the atoms still 
have the same Boltzmann-Gibbs distribution (J5|. Sim- 
ilarly, routine daily operations of the central bank (as 
opposed to outstanding interventions, such as the recent 
quantitative easing by the Federal Reserve Bank) do not 
necessarily spoil the equilibrium distribution of money 
obtained for a closed system. 

Another potential problem with conservation of money 
is debt, which will be discussed in more detail in Sec. [V] 
To simplify initial consideration, we do not allow agents 
to have debt in this section. Thus, by construction, 
money balances of the agents cannot drop below zero, 
i.e. TOi > for all i. Transaction (12) takes place only 



when an agent has enough money to pay the price, i.e. 
mi > Am. An agent with m, = cannot buy goods 
from other agents, but can receive money for delivering 
goods or services to them. Most econophysics models, see 
reviews b y |Chakraborti, Tokea, Patriarca, and Abergel| 
( 2011a|b l; [Yakovenko and Rosser ( 2009 ), and some eco- 
nomic models ( Kiyotaki and Wright 1993 Molico 2006) 



do not consider debt, so it is not an uncommon sim- 
plifying assumption. In this approach, the agents are 
liquidity-constrained. As the recent economic crisis bru- 
tally reminded, liquidity constraint rules the economy, 
even though some players tend to forget about it during 
debt binges in the bubbles. 
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The transfer of money in Eq. ( 12 ) from one agent to 
another represents payment for delivered goods and ser- 
vices, i.e. there is an implied covmterfiow of goods for each 
monetary transaction. However, keeping track explicitly 
of the stocks and fluxes of consumable goods would be a 
very complicated task. One reason is that many goods, 
e.g. food and other supplies, and most services, e.g. get- 
ting a haircut or going to a movie, are not tangible and 
disappear after consumption. Because they are not con- 
served, and also because they are measured in many dif- 
ferent physical units, it is not practical to keep track of 
them. In contrast, money is measured in the same unit 
(within a given country with a single currency) and is 
conserved in local transactions (13 1, so it is straightfor- 
ward to keep track of money. 

Thus, I choose to keep track only of the money bal- 
ances of the agents rrij , but not of the other commodities 
in the system. This situation can be visualized as follows. 
Suppose money accounts of all agents are kept on a cen- 
tral computer server. A computer operator observes the 
multitude of money transfers ( 12 ) between the accounts, 



but he or she does not have information about the reasons 
for these transfers and does not know what consumable 
goods were transferred in exchange. Although purpose- 
ful and rational for individual agents, these transactions 
look effectively random for the operator. (Similarly, in 
statistical physics, each atom follows deterministic equa- 
tions of motion, but, nevertheless, the whole system is 
effectively random.) Out of curiosity, the computer op- 
erator records a snapshot of the money balances rrij of 
all agents at a given time. Then the operator counts 
the numbers of agents Nk who have money balances in 
the intervals between rrik and rrik + m* , where is a 
reasonably small money window, and the index fc labels 
the money interval. The operator wonders what is the 
general statistical principle that governs the relative oc- 
cupation numbers P(rrik) = Nk/N. In the absence of 
additional information, it is reasonable to derive these 
numbers by maximizing the multiplicity f2 and the en- 
tropy S of the money distribution in Eqs. and ([2 1, 
subject to the constraint J2k m k^k — M in Eq. (14 1. 



The result is given by the Boltzmann-Gibbs distribution 
for money 



P(m) 



-m/Trr, 



(15) 



where c is a normalizing constant, and T m is the money 
temperature. Similarly to Eq. (11), the temperature is 
obtained from the constraint ( 14 ) and is equal to the 
average amount of money per agent: T m = (m) = M/N. 

The statistical approach not only predicts the equilib- 
rium distribution of money ( |15[ ), but also allows us to 
study how the probability distribution of money P(m, t) 
evolves in time t toward the equilibrium. Dra gulescu| 
and Yakovenko (2000) performed computer simulations 



of money transfers between the agents in different mod- 
els. Initially all agents were given the same amount of 
money, say, $1000. Then, a pair of agents was ran- 
domly selected, the amount Am was transferred from 
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FIG. 2 Histogram and points: The stationary probability dis- 
tribution of money P(m) obtained in computer simulations 



of the random transfer models (Dragulescu and Yakovenko 
2000 1. Solid curves: Fits to the exponential distribution ( 15 1. 



Vertical line: The initial distribution of money. 



one agent to another, and the process was repeated many 
times. Time evolution of the probability distribution of 
money P{m, t) is shown in computer animation videos 
by |Chen and Yakovenko| p007| ; |Wright| p007| ). After 
a transitory period, money distribution converges to the 
stationary form shown in Fig. [2j As expected, the distri- 
bution is well fitted by the exponential function (15). 



In the simplest model considered by [Dragulesc u and| 
Yakovenko (2000), the transferred amount Am = $1 was 



constant. Computer animation (Chen and Yakovenko 



2007) shows that the initial distribution of money first 
broadens to a symmetric Gaussian curve, typical for a 
diffusion process. Then, the distribution starts to pile up 
around the m — state, which acts as the impenetrable 
boundary, because of the imposed condition m > 0. As a 
result, P(m) becomes skewed (asymmetric) and eventu- 
ally reaches the stationary exponential shape, as shown 
in Fig. [2j The boundary at m = is analogous to the 
ground-state energy in physics, i.e. the lowest possible 
energy of a physical system. Without this boundary 
condition, the probability distribution of money would 
not reach a stationary shape. Computer animations 
(|Chen and Yakovenko[ |2007| |Wright[ |2007|) also show 



how the entropy M of the money distribution, defined 
as S/N = — J2 k P[mk) In P(mfc), grows from the initial 
value S — 0, where all agents have the same money, to 
the maximal value at the statistical equilibrium. 

Dragulescu and Yakovenko ( |2000 ) performed simula- 
tions of several models with different rules for money 
transfers Am. As long as the rules satisfy the time- 



reversal symmetry (Dragulescu and Yakovenko 2000), 



the stationary distribution is always the exponential one 
( 15 ), irrespective of initial conditions and details of trans- 
fer rules. However, this symmetry is violated by the 
multiplicative rules of transfer, such as the proportional 
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rule Am = jrrii (Angle 1986 Ispolatov, Krapivsky, and 



Chakrabart 
20061 



Redner 1998 >, the saving propensity ( Chakraborti and 



These 



2000), and the negotiable prices (Molico 



models produce Gamma-like distributions, 
as well as a power-law tail for a random distribution of 
saving propensities (Chatterjee and Chakrabarti 2007). 
Despite some mathematical differences, all these models 
demonstrate spontaneous development of a highly un- 
equal probability distribution of money as a result of ran- 



dom money transfers between the agents ( Chakraborti, 
Tokea, Patriarca, and Abergel, 2011a|b ). The money 



transfer models can be also formulated in the language of 
commodity trading between the agents optimizing their 
utility functions (Chakrabarti and Chakrabarti, 20091, 



which is closer to the traditional language of the eco- 
nomic literature. More involved agent-based simulations 
were developed by Wright ( 2005 2009 ) and demonstrated 



emergence of a two-classes society from the initially equal 
agents. This work was further developed in the book by 



Cockshott, Cottrell, Michaelson, Wright, and Yak ovenko| 



(2009), integrating economics, computer science, and 



physics. Empirical data on income distribution discussed 
in Sec. |VI| show direct evidence for the two-class society. 

Let us compare the results of Sees. |III| and |IV| In 
the former, the Boltzmann-Gibbs formalism is applied 
to the fluxes of commodities, and the distribution of 
wages ^ is predicted to be exponential. In the latter, 
the Boltzmann-Gibbs formalism is applied to the stocks 
of commodities, and the distribution of money balances 



(15) is predicted to be exponential. There is no contra- 



diction between these two approaches, and both fluxes 
and stocks may have exponential distributions simulta- 
neously, although this is not necessarily required. These 
papers were motivated by differ ent questions. The pa- 
pers by [Foley] (119941 119961 119991) were motivated by the 



long-standing question of how markets determine prices 
between different commodities in a decentralized man- 



ner. The paper by Dragulescu and Yakovenko (2000) 



was motivated by another long-standing question of how 
inequality spontaneously develops among equal agents 
with equal initial endowments of money. In the neoclassi- 
cal thinking, equal agents with equal initial endowments 
should forever stay equal, which contradicts everyday ex- 
perience. Statistical approach argues that the state of 
equality is fundamentally unstable, bec ause it has a very 
low entropy. The law of probabilities (iFarjoun and Ma 



chover 1983 ) leads to the exponential distribution, which 



is highly unequal, but stable, because it maximizes the 
entropy. Some papers in the economic literature mod- 
eled the distributions of both prices and money balances 
within a common statistical framework (Molico 2006). 



It would be very interesting to compare the theoretical 
prediction ( 15 ) with empirical data on money distribu- 
tion. Unfortunately, it is very difficult to obtain such 
data. The distribution of balances on deposit accounts 
in a big enough bank would be a reasonable approxima- 
tion for the distribution of money among the population. 
However, such data are not publicly available. In con- 



trast, plenty of data on income distribution are available 
from the tax agencies and surveys of population. These 
data will be compared with the theoretical prediction Q 
in Sec. |VI| However, before that, I will discuss in the next 
section how the results of this section would change when 
the agents are permitted to have debt. 



V. MODELS OF DEBT 

From the standpoint of individual economic agents, 
debt may be considered as negative money. When an 
agent borrows money from a bank, 5 the cash balance 
of the agent (positive money) increases, but the agent 
also acquires an equal debt obligation (negative money), 
so the total balance (net worth) of the agent remains 
the same. Thus, the act of money borrowing still sat- 
isfies a generalized conservation law of the total money 
(net worth) , which is now defined as the algebraic sum of 
positive (cash M) and negative (debt D) contributions: 
M — D = Mf,, where Mb is the original amount of money 



in the system, the monetary base ( |McConnell and Brue 



1996). When an agent needs to buy a product at a price 
Am exceeding his money balance mi, the agent is now 
permitted to borrow the difference from a bank. After 
the transaction, the new balance of the agent becomes 
negative: m' = rrij — Am < 0. The local conservation 



law ( 12 ) and ( 13 ) is still satisfied, but now it involves neg- 
ative values of m. Thus, the consequence of debt is not 
a violation of the conservation law, but a modification of 
the boundary condition by permitting negative balances 
m, < 0, so m = is not the ground state any more. 

If the computer simulation with Am = $1 described 
in Sec. |IV| is repeated without any restrictions on the 
debt of the agents, the probability distribution of money 
P(m, t) never stabilizes, and the system never reaches a 
stationary state. As time goes on, P(m, t) keeps spread- 
ing in a Gaussian manner unlimitedly toward m = +oo 
and m = — oo. Because of the generalized conservation 
law, the first moment of the algebraically defined money 
m remains constant (m) = M^/N. It means that some 
agents become richer with positive balances m > at 
the expense of other agents going further into debt with 
negative balances m < 0. 

Common sense, as well as the current financial crisis, 
indicate that an economic system cannot be stable if un- 
limited debt is permitted. In this case, the agents can 
buy goods without producing anything in exchange by 
simply going into unlimited debt. Eq. (15) is not appli- 



cable in this case for the following mathematical reason. 
The normalizing coefficient in Eq. (15) is c = 1/Z, where 



5 Here we treat the bank as being outside of the system consisting 
of ordinary agents, because we arc interested in money distribu- 
tion among these agents. The debt of the agents is an asset for 
the b ank, and deposits of cash into the bank are liabilities of the 
bank dMcConnell and Brue||1996l. 
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FIG. 3 Histograms: Stationary distributions of money with 
and without debt (Dragulescu and Yakovenko 2000). The 



debt is limited to = 800. Solid curves: Fits to the expo- 
nential distributions with the money temperatures T m = 1800 
and T m = 1000. 



FIG. 4 The stationary distributio n of money for th e required 
reserve ratio R = 0.8 ( |Xi, Ding, and Wang| |2005| ). The dis- 
tribution is exponential for both positive and negative money 
with different temperatures T+ and T_ , as shown in the inset 
on log-linear scale. 



Z is the partition function 



(16) 



and the sum is taken over all permitted states k of the 
agents. When debt is not permitted, and money balances 
are limited to non-negative values m/s > 0, the sum in 
Eq. ( 16 ) converges, if the temperature is selected to be 
positive T m > 0. This is the case in physics, where ki- 
netic energy takes non-negative values e > 0, and there is 
a ground state with the lowest possible energy, i.e. energy 
is bounded from below. 6 In contrast, when debt is per- 
mitted and is not limited by any constraints, the sum in 



the partition function ( 16 ) over both positive and nega- 



tive values of diverges for any sign of the temperature 
T 

It is clear that some sort of a modified boundary con- 
dition has to be imposed in order to prevent unlimited 
growth of debt and to ensure overall stability and equi- 
librium in the system. Dragulescu and Yakovenkol ( 2000 ) 



considered a simple model where the maximal debt of 
each agent is limited to mj. In this model, P{m) again 
has the exponential shape, but with the new boundary 
condition at m = -m^ and the higher money tempera- 
ture Tfj = + Mfe/iV, as shown in Fig. [3j This result is 
analogous to Eq. (fill) with the substitution wq — > — 1714. 



6 When the energy spectrum is bounded both from above and 
below, the physical temperature T may take either positive or 
negative values. For a quantum paramagnet discussed in Scc|TT] 
the negative temperature T corresponds to the inverse popula- 
tion, where the higher energy levels Eh have higher population 
N), than the lower energy levels. 



By allowing the agents to go into debt up to m<j, we ef- 
fectively increase the amount of money available to each 
agent by m d . 



Xi, Ding, and Wang (2005) considered a more realis- 



tic boundary condition, where a constraint is imposed on 
the total debt of all agents in the system, rather than on 
individual debt of each agent. This is accomplished via 



the required reserve ratio R (McConnell and Brue 1996 1. 



Banks are required by law to set aside a fraction R of 
the money deposited into bank accounts, whereas the re- 
maining fraction 1 — R can be lent to the agents. If the 
initial amount of money in the system (the money base) 
is Mb, then, with repeated lending and borrowing, the 
total amount of positive money available to the agents in- 
creases up to M — Mb /R, where the factor 1 /R is called 
the money multiplier (McConnell and Brue 1996). This 



is how "banks create money". This extra money comes 
from the increase in the total debt D — Mb/R — Mb of 
the agents. Now we have two constraints: on the pos- 
itive money M and on the maximal debt D. Applying 
the principle of maximal entropy, we find two exponen- 
tial distributions, for positive and negative money, char- 
acterized by two different temperatures, T + = Mb/RN 
and T_ = Mb(l — R)/RN . This result was confirmed in 
computer simulations by Xi, Ding, and Wang (2005), as 
shown in Fig. [4] 

However, in the real economy, the reserve requirement 
is not effective in stabilizing debt in the system, because 
it applies only to retail banks insured by FDIC, but not 
to investment banks. Moreover, there are alternative in- 
struments of debt, including derivatives, credit default 
swaps, and various other unregulated "financial innova- 
tions". As a result, the total debt is not limited in prac- 
tice and can reach catastrophic proportions. Dragulescu 
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and Yakovenko ( 2000 ) studied a model with different in- 
terest rates for deposits into and loans from a bank, but 
without an explicit limit on the total debt. Computer 
simulations by |Dragulescu and Yakovenko ([2000 ) show 
that, depending on the choice of parameters, the total 
amount of money in circulation either increases or de- 
creases in time, but never stabilizes. The interest am- 
plifies the destabilizing effect of debt, because positive 
balances become even more positive and negative even 
more negative. Arguably, the current financial crisis was 
caused by the enormous debt accumulation, triggered by 
subprimc mortgages and financial derivatives based on 
them. The lack of restrictions was justified by a mis- 
guided notion that markets will always arrive to an equi- 
librium, as long as government does not interfere with 
them. However, an equilibrium is possible only when 
the proper boundary conditions are imposed. Debt does 
not stabilize by itself without enforcement of boundary 
conditions. 

Bankruptcy is a mechanism for debt stabilization. In- 
terest rates are meaningless without a mechanism for 
triggering bankruptcy. Bankruptcy erases the debt of 
an agent (the negative money) and resets the balance 
to zero. However, somebody else (a bank or a lender) 
counted this debt as a positive asset, which also be- 
comes erased. In the language of physics, creation of 
debt is analogous to particle-antiparticle generation (cre- 
ation of positive and negative money), whereas cancella- 
tion of debt corresponds to particle-antiparticle annihila- 
tion (annihilation of positive and negative money) . The 
former and the latter dominate during booms and busts 
and correspond to monetary expansion and contraction. 

Besides fiat money created by central governments, 
numerous attempts have been made to create alterna- 



tive community money from scratch (Kichiji and Nishibe 



2007). In such a system, when an agent provides goods 
or services to another agent, their accounts are credited 
with positive and negative tokens, as in Eq. (12 1. Be- 
cause the initial money base Mb = is zero in this case, 
the probability distribution of money P(m) is symmetric 
with respect to positive and negative m, i.e. "all money 
is credit". Unless boundary conditions are imposed, the 
money distribution P(m, i) would never stabilize in this 
system. Some agents would try to accumulate unlimited 
negative balances by consuming goods and services and 
not contributing anything in return, thus undermining 
the system. 



VI. EMPIRICAL DATA ON INCOME DISTRIBUTION 

In this section, the theoretical prediction (|9| for the 
probability distribution of wages will be compared with 
the empirical data on income distribution in the USA. 
The data from the US Census Bureau and the Internal 
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FIG. 5 Histogram: Probability distribution of individual in 
come from the US Cens us Bureau data for 1996 (Dragulescu 
Solid line: 



and Yakovenko 2001a I. Solid line: Fit to the exponential 
law. Inset plot A: The same with the logarithmic vertical 
scale. Inset plot B: Cumulative probability distribution of 
individual income from the PSID data for 1992. 



tribution, and the wage temperature T w = 20.3 k$/year 
for 1996 is obtained from the fit. Eq. Q also has the 
parameter wg, which represents the minimal wage. Al- 
though the data deviate from the exponential distribu- 
tion in Fig. [5] for incomes below 5 k$/year, the proba- 
bility density at w — is still non-zero, -P(O) 7^ 0. One 
would expect difficulties in collecting reliable data for 
very low incomes. Given limited accuracy of the data 
in this range and the fact that the deviation occurs well 
below the average income T Wl we can set wq — for the 
practical purposes of fitting the data. Although there is 
a legal requirement for minimal hourly wage, there is no 
such requirement for annual wage, which depends on the 
number of hours worked. 

The upper limit of the income data analyzed by 



Dragulescu and Yakovenko 


( 


2001a 


) was about 120 


k$/year. Subsequently, 


Dragulescu and Yakovenko 



(2001b 2003) analyzed income data up to 1 M$/year 



Revenue Service (I RS) were analyzed by Dragulescu and 
Yakovenko (2001a). One plot from this paper is shown in 
Fig. [5j The data agree very well with the exponential dis- 



and found that the upper tail of the distribution fol- 
lows a power law. Thus, income distribution in the 
USA has a two-class structure, as shown in Fig. [6] 
This figure shows the cumulative distribution function 
C(r) = P(r') dr' , where P(r) is the probability den- 
sity, and the variable r denotes income. When P(r) is 
an exponential or a power-law function, C{r) is also an 
exponential or a power-law function. The straight line 
on the log- linear scale in the inset of Fig. [6] demonstrates 
the exponential Boltzmann-Gibbs law for the lower class, 
and the straight line on the log-log scale in the main panel 
illustrates the Pareto power law for the upper class. The 
intersection point between the exponential and power-law 
fits in Fig. [6] defines the boundary between the lower and 
upper classes. For 1997, the annual income separating 
the two classes was about 120 k$. About 3% of the pop- 
ulation belonged to the upper class, and 97% belonged to 
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United States, IRS data for 1997 
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FIG. 6 Cumulative probability distribution of tax returns for 
USA in 1997 shown on log-log (main panel) a nd log-linear 
(inset) scales (Dragulescu and Yakovenko 20031. Points rep- 
resent the IRS data, and solid lines are fits to the exponential 
and power-law functions. 
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FIG. 7 Cumulative probability distributions of tax returns 
for 1983-2001 plotted on log-log scale versus r/T r (the an- 
nual income r normalized by the average income T r in the 



expon ential part of the distribution) (Silva and Yakovenko 
20051. The columns of numbers give the values of T r for the 



corresponding years. 



the lower class. Although the existence of social classes 
has been known since Karl Marx, it is interesting that 
they can be straightforwardly identified by fitting the 
empirical data with simple mathematical functions. The 
exponential distribution ([9| applies only to the wages of 
workers in the lower class, whereas the upper tail repre- 
sents capital gains and other profits of the firms owners. 



and Yakovenko 2005 1 : 



Silva and Yakovenko ( 2005 ) studied historical evolu- 



tion of income distribution in the USA during 1983-2001 
using the IRS data. The structure of income distribu- 
tion was found to be qualitatively similar for all years, as 
shown in Fig. [7J The average income in nominal dollars 
has approximately doubled during this time interval. So, 
the horizontal axis in Fig. [7] shows the normalized income 
r/T r , where the income temperature T r was obtained by 
fitting the exponential part of the distribution for each 
year. The values of T r are shown in Fig. [7] The plots for 
the 1980s and 1990s are shifted vertically for clarity. We 
observe that the data points for the lower class collapse 
on the same exponential curve for all years. This demon- 
strates that the relative income distribution for the lower 
class is extremely stable and does not change in time, de- 
spite gradual increase in the average income in nominal 
dollars. This observation confirms that the lower class is 
in a statistical "thermal" equilibrium. 

On the other hand, Fig. [7] also shows that income dis- 
tribution of the upper class does not rescale and signifi- 
cantly changes in time. |Banerjee and Yakovenko (2010) 
found that the exponent a of the power law C(r) oc l/r a 
has decreased from 1.8 in 1983 to 1.3 in 2007, as shown 
in Panel (b) of Fig. [8j This means that the upper tail 
has become "fatter" . Another informative parameter is 
the fraction / of the total income in the system going 
to the upper class (Banerjee and Yakovenko 2010 Silva 



(r) 



(17) 



Here (r) is the average income of the whole population, 
and the temperature T r is the average income in the ex- 
ponential part of the distribution. Eq. (17) gives a well- 



defined measure of the deviation of the actual income 
distribution from the exponential one and, thus, of the 
fatness of the upper tail. Panel (c) in Fig. [8] shows his- 
torical evolution of the parameters (r), T r , and / (Baner- 
jee and Yakovenko 2010). We observe that T r has been 



increasing approximately linearly in time (this increase 
mostly represents inflation). In contrast, (r) had sharp 
peaks in 2000 and 2007 coinciding with the heights of 
speculative bubbles in financial markets. The fraction / 
has been increasing for the last 20 years and reached max- 
ima exceeding 20% in the years 2000 and 2007, followed 
by sharp drops. We conclude that speculative bubbles 
greatly increase overall income inequality by increasing 
the fraction of income going to the upper class. When 
bubbles collapse, income inequality decreases. Similar re- 
sults were found for Japan by|Aoyama, Souma, and Fu- 
Fujiwara, Souma, Aoyama, Kaizoji, and 



|jrwara| (2003 
Aokil (120031 



F 



Income inequality can be also characterized by the 



Lorenz curve and the Gini coefficient (Kakwani 1980). 



The Lorenz curve is defined in terms of the two coordi- 
nates x(r) and y(r) depending on a parameter r: 

r r ( r r' P(r') dr' 

x(r)= P(r')dr', y(r) = J ° > , . (18) 



/„ r'P(r') dr' ' 

The horizontal coordinate x(r) is the fraction of the pop- 
ulation with income below r, and the vertical coordi- 
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FIG. 8 (a) The Gini coefficient G for income distribution in 
the USA in 1983-2009 (connected line), compared with the 
theoretical formula G = (1 + /)/2 (open circles), (b) The ex- 
ponent a of the power-law tail of income distribution, (c) The 
average income (r) in the whole system, the average income T r 
in the lower class (the temperature of the exponential part), 
and the fraction of income /, Eq. ( 17 1, going to the upper tail 
(Banerjee and Yakovenko 



2010) 



nate y(r) is the fraction of the total income this popu- 
lation accounts for. As r changes from to oo, x and 
y change from to 1 and parametrically define a curve 
in the (x, y) plane. For for an exponential distribution 
P(r) = cexp(— r/T r ), the Lorenz curve is (Dragulescu 
and Yakovenko| |2001a| ) 



y 



+ (1 - x) ln(l - x). 



(19) 



Fig. [9] shows the Lorenz curves for 1983 and 2000 com 



puted from the IRS data (Silva and Yakovenko 2005 1 . 
The data points for 1983 (squares) agree reasonably well 
with Eq. ( 19 ) for the exponential distribution, which is 



shown by the upper curve. In contrast, the fraction / of 
income in the upper tail becomes so large in 2000 that 
Eq. (19 ) ha s to be modified as follows (|Dragulescu and 



Yakovenko[ |2003| |Silva and Yakovenko| [2005| 



y = (1 - f)[x + (1 - x) ln(l - x)\ + f G(x - 1). (20) 



The last term in Eq. ([20j) represents the vertical jump of 
the Lorenz curve at x = 1, where a small percentage of 
population in the upper class accounts for a substantial 
fraction / of the total income. The lower curve in Fig. [9] 
shows that Eq. (20) fits the data points for 2000 (circles) 
very well. 

The deviation of the Lorenz curve from the straight 
diagonal line in Fig. [9] is a measure of income inequality. 
Indeed, if everybody had the same income, the Lorenz 
curve would be the diagonal line, because the fraction 



90- 
80 

70 0.5, 
60 



50- 
40- 
30- 
20 



Gini from IRS data 
Gini=(1+f)/2 



..... 



1980 1985 1990 1995 2000/ 
Year : 



10 20 30 40 50 60 70 80 
Cumulative percent of tax returns 



FIG. 9 Main panel: 



Lorenz plots for income distribution in 
The data points 
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1983 and 2000 l ]Silva and Yakovenko 
are from the IRS, and the theoretical curves represent Eq. 
(201 with the parameter / deduced from Eq. ( 17 1. Inset: The 



closed circles are the IRS data for the Gini coefficient G, and 
the open circles show the theoretical formula G = (1 + /)/2. 



of income would be proportional to the fraction of the 
population. The standard measure of income inequality 
is the Gini coefficient < G < 1, which is defined as 
the area between the Lorenz curve and the diagonal line, 
divid ed by the area o f the triangle bene ath the diagonal 
line (Kakwani 1980). It was shown by Dragulescu and 



Yakovenko| ( |2001a [) that the Gini coefficient is G = 1/2 
for a purely exponential distribution. Historical evolu- 
tion of the empirical Gini coefficient is shown in the inset 
of Fig. [9] and in Panel (a) of Fig. [8] In the first ap- 
proximation, the values of G are close to the theoretical 
value 1/2. However, if we take into account the upper 
tail using Eq. ( [20] ), the formula for the Gini coefficient 
becomes G = (1 + f)/2 flSilva and Yakovenko| [2005 1. 
The inset in Fig. [9] and panel (a) in Fig. |8| show that this 
formula gives a very good fit of the IRS data starting 
from 1995 using the values of / deduced from Eq. (17). 



The values G < 1/2 in the 1980s cannot be captured by 
this formula, because the Lorenz data points for 1983 lie 
slightly above the theoretical curve in Fig. [9] We con- 
clude that the increase of income inequality after 1995 
originates completely from the growth of the upper-class 
income, whereas income inequality within the lower class 
remains constant. 

So far, we discussed the distribution of individual in- 
come. An interesting related question is the distribution 
-P2M of family income r = r±+r2, where r\ and r% are the 
incomes of spouses. If individual incomes are distributed 
exponentially P(r) cx exp(— r/T r ), then 

P 2 (r)= [ dr'P(r')P(r-r , ) = crcxp{-r/T r ), (21) 
Jo 

where c is a normalization constant. Fig. |10| shows that 
Eq. (21 ) is in good agreement with the family income dis- 
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United States, Bureau of Census data for 1996 
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FIG. 10 Histogram: Probability distribution of family in- 
come for families with two adults, US Census Bureau data 
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Dragulescu and Yakovenko 2001a). Solid line: Fit to Eq 



United States, Bureau of Census data for 1947-1994 
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FIG. 11 Main panel: Lorenz plot for family income, calcu- 
lated from Eq. (21 1 and compared with the US Census Bu- 



reau data points for 1947-1994 (Dragulescu and Yakovenko 
2001a I. Inset: Data points from the US Census Bureau for 



the Gini coefficient for families, compared with the theoretical 
value 3/8=37.5%. 



tribution data from the US Census Bureau (D ragulescu 
and Yakovenko 2001a). It is assumed in Eq. (21) that 



incomes of spouses are uncorrected. This simple ap- 
proximation is indeed supported by the scatter plot of 
i ncom es of spouses shown in Dragule scu and Yakovenko] 
(2003). The Gini coefficient for the family income dis- 



tributional) wa s analy tically calculated by Dragulescu 
and Yakovenko] ( ]2001a| ) as G = 3/8 = 37.5%T Fig. 
11| shows the Lorenz quintiles and the Gini coefficient 
for 1947-1994 plotted from the US Census Bureau data 
(Dragulescu and Yakovenko 2001a). The solid line, rep- 



resenting the Lorenz curve calculated from Eq. (21), is 
in good agreement with the data. The Gini coefficient, 
shown in the inset of Fig. [TT] is close to the calculated 
value of 37.5%. Stability of inc ome distribut ion until the 
late 1980s was pointed out by [Levy (19871, but a very 
different explanation of this stability was proposed. 

The exponential distribution of wages ^ was derived 
assuming random realizations of all possible configura- 
tions in the absence of specific regulations of the labor 
market. As such, it can be taken as an idealized limiting 
case for comparison with actual distributions in differ- 
ent countries. If specific measures are implemented for 
income redistribution, then the distribution may deviate 
from the exponential one. An example of such a redistri- 
bution was studied by |Banerjee, Y akovenko, and Di Mat- 



teo| ( 2006 1 for Australia, where P(r) has a sharp peak at 
a certain income stipulated by government policies. This 
is in contrast to the data presented in this section, which 
indicate that income distribution in the USA is close to 
the idealized one for a labor market without regulation. 
Numerous income distribution studies for other countries 
are cited in the review paper by |Yakovenko and Rosser| 
p009l. 



VII. PROBABILITY DISTRIBUTION OF ENERGY 
CONSUMPTION 

So far, we discussed how monetary inequality devel- 
ops for statistical reasons. Now let us discuss physical 
aspects of the economy. Since the beginning of the in- 
dustrial revolution, rapid technological development of 
human society has been based on consumption of fossil 
fuel, such as coal, oil, and gas, accumulated in the Earth 
for billions of years. As a result, physical standards of 
living in modern society are primarily determined by the 
level of energy consumption per capita. Now it becomes 
exceedingly clear that fossil fuel will be exhausted in the 
not-too-distant future. Moreover, consumption of fossil 
fuel releases CO2 into the atmosphere and affects global 
climate. These pressing global problems pose great tech- 



nolog ical and social challenges ( Rezai, Foley, and Taylor 
20121. 



Energy consumption per capita varies widely around 
the globe. This heterogeneity is a challenge and a compli- 
cation for reaching a global consensus on how to deal with 
the energy problems. Thus, it is important to understand 
the origin of the global inequality in energy consumption 
and characterize it quantitatively. Here I approach this 
problem using the method of maximal entropy. 

Let us consider an ensemble of economic agents and 
characterize each agent j by the energy consumption Cj 
per unit time. Notice that here €j denotes not energy, 
but power, which is measured in kilo Watts (kW). Simi- 
larly to Sec. [TTJ let us introduce the probability density 
P(e), so that P(e) de gives the probability to have energy 
consumption in the interval from e to e + de. Energy 
production, based on extraction of fossil fuel from the 
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World distribution of energy consumption 



Inequality in the global energy consumption 
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FIG. 12 Cumulative distribution functions C(e) for the en- 
ergy consumption per capita around the world for 1990, 2000, 
and 2005. The solid curve is the exponential function ( |22[ ) 
with the parameter T e = (e) = 2.2 kW. 



Earth, is a physically limited resource, which is divided 
for consumption among the global population. It would 
be very improbable to divide this resource equally. Much 
more likely, this resource would be divided according to 
the entropy maximization principle, subject to the global 
energy production constraint. Following the same proce- 
dure as in Sec. [IT] we arrive to the conclusion that P(e) 
should follow the exponential law analogous to Eq. |5| 



P(e) 



e/T. 



(22) 



Here c is a normalization constant, and the temperature 
T e = (e) is the average energy consumption per capita. 

Using the data from the W orld Resources Institute, 
Banerjee and Yakovenko ( 2010 ) constructed the probabil- 
ity distribution of energy consumption per capita around 
the world and found that it approximately follows the 
exponential law, as shown in Fig. 12 The world av- 
erage energy consumption per capita is (e) = 2.2 kW, 
compared with 10 kW in the USA and 0.6 kW in India 
(Banerjee and Yakovenko 2010). However, if India and 



other developing countries were to adopt the same energy 
consumption level per capita as in the USA, there would 
be not enough energy resources in the world to do that. 
One can argue that the global energy consumption in- 
equality results from the constraint on energy resources, 
and the global monetary inequality adjusts accordingly 
to implement this constraint. Since energy is purchased 
with money, a fraction of the world population ends up 
being poor, so that their energy consumption stays lim- 
ited. 

Fig. [13] shows the Lorenz curves for the global energy 
consumption per capita in 1990 , 2000, and 2005 from 
Banerjee and Yakovenko (20101. The black solid line 
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is the theoretical Lorenz curve ( 19 1 for the exponential 



distribution (22 I. In the first approximation, the cmpir- 
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FIG. 13 Lorenz curves for the energy consumption per capita 
around the world in 1990, 2000, and 2005, compared with the 
Lorenz curve ( |19[ ) for the exponential distribution. 



ical curves are reasonably close to the theoretical curve, 
but some deviations are clearly visible. On the Lorenz 
curve for 1990, there is a kink or a knee indicated by the 
arrow, where the slope of the curve changes appreciably. 
This point separates developed and developing countries. 
Mexico, Brazil, China, and India are below this point, 
whereas Britain, France, Japan, Australia, Russia, and 
USA are above. The slope change of the Lorenz curve 
represents a gap in the energy consumption per capita 
between these two groups of countries. Thus, the phys- 
ical difference between developed and developing coun- 
tries lies in the degree of energy consumption and utiliza- 
tion, rather than in more ephemeral monetary measures, 
such as dollar income per capita. However, the Lorenz 
curve for 2005 is closer to the exponential curve, and the 
kink is less pronounced. It means that the energy con- 
sumption inequality and the gap between developed and 
developing countries have decreased, as also confirmed 
by the decrease in the Gini coefficient G listed in Fig. [13] 
This result can be attributed to the rapid globalization 
and stronger mixing of the world economy in the last 20 
years. However, the energy consumption distribution in 
a well-mixed globalized world economy approaches to the 
exponential one, rather than to an equal distribution. 

The inherent inequality of the global energy consump- 
tion makes it difficult for the countries at the opposite 
ends of the distribution to agree on consistent measures 
to address the energy and climate challenges. While not 
offering any immediate solutions, I would like to point out 
that renewable energy has different characteristics than 
fossil fuel. Because solar and wind energy is typically gen- 
erated and consumed locally and not transported on the 
global scale, it is not subject to the entropy-maximizing 
redistribution. Thus, a transition from fossil fuel to re- 
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newable energy gives a hope for achieving a more equal 
global society. 



VIII. CONCLUSIONS 

Statistical approach and entropy maximization are 
very general and powerful methods for studying big en- 
sembles of various nature, be it physical, biological, eco- 
nomical, or social. Duncan Foley is one of the pioneers of 
applying these methods in economics ( jFoiey]p94lp96l 
1999[). The exponential distribution of wages, predicted 



by the statistical equilibrium theory of a labor market 
( |Foley 1996[ ), is supported by empirical data on income 
distribution in the USA for the majority of population. 
In contrast, the upper tail of income distribution fol- 
lows a power law and expands dramatically during fi- 
nancial bubbles, which results in a significant increase 
of the overall income inequality. The two-class struc- 
ture of the American society is apparent in the plots of 
income distribution, where the lower and upper classes 
are described by the exponential and power-law distri- 
butions. The entropy maximization method also demon- 
strates how a highly unequal exponential probability dis- 
tribution of money among the initially equal agents is 
generated as a results of stochastic monetary transac- 
tions between the agents. These ideas also apply to 
the global inequality of energy consumption per capita 
around the world. The empirical data show convergence 
to the predicted exponential distribution in the process 
of globalization. Global monetary inequality may be a 
consequence of the constraint on global energy resources, 
because it limits energy consumption per capita for a 
large fraction of the world population. 
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